Analysis of phonetic transcriptions for Danish automatic speech recognition
نویسنده
چکیده
Automatic speech recognition (ASR) relies on three resources: audio, orthographic transcriptions and a pronunciation dictionary. The dictionary or lexicon maps orthographic words to sequences of phones or phonemes that represent the pronunciation of the corresponding word. The quality of a speech recognition system depends heavily on the dictionary and the transcriptions therein. This paper presents an analysis of phonetic/phonemic features that are salient for current Danish ASR systems. This preliminary study consists of a series of experiments using an ASR system trained on the DK-PAROLE corpus. The analysis indicates that transcribing e.g. stress or vowel duration has a negative impact on performance. The best performance is obtained with coarse phonetic annotation and improves performance 1% word error rate and 3.8% sentence error rate.
منابع مشابه
Application-oriented validation o preliminary r
There is an increasing need for automatic procedures to generate and validate phonetic transcriptions. As the production of manual phonetic transcriptions tends to be time-consuming, error-prone and costly, procedures have been developed to derive phonetic transcriptions automatically by means of automatic speech recognition technology. Such automatic phonetic transcriptions are usually validat...
متن کاملA pplication-orien ted validation o f phonetic transcriptions: prelim inary results
There is an increasing need for automatic procedures to generate and validate phonetic transcriptions. As the production of manual phonetic transcriptions tends to be time-consuming, error-prone and costly, procedures have been developed to derive phonetic transcriptions automatically by means of automatic speech recogni tion technology. Such automatic phonetic transcrip tions are usually val...
متن کاملAutomatic generation of phonetic transcriptions for large speech corpora
We describe a method for the automatic production of phonetic transcriptions in large speech corpora. First, we focus on the application of different techniques for the generation of pronunciation variants. Then, we explain the application of a speech recognition system for selecting the acoustically best matching phonetic transcription. The system is evaluated on different test sets selected f...
متن کاملValidation of phonetic transcriptions based on recognition performance
In fundamental linguistic as well as in speech technology re search there is an increasing need for procedures to automat ically generate and validate phonetic transcriptions. Whereas much research has already focussed on the automatic genera tion o f phonetic transcriptions, far less attention has been paid to the validation of such transcriptions. In the little research performed in this a...
متن کاملValidation of phonetic transcriptions in the context of automatic speech recognition
Some of the speech databases and large spoken language corpora that have been collected during the last fifteen years have been (at least partly) annotated with a broad phonetic transcription. Such phonetic transcriptions are often validated in terms of their resemblance to a handcrafted reference transcription. However, there are at least two methodological issues questioning this validation m...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013